
what is claimed is: 

1. A method predicting a scaffold of a protein comprising 
a query sequence , 

wherein said mfethod uses a database which contains 
environmental information on the side chain of each amino acid residue 
contained in the amino Acid sequence of each reference protein whose 
three-dimensional struoture is known or predictable, 
and wherein said method \comprises the step of: 

conducting matching based on the environmental information 
on each amino acid residue on each reference protein and hydrophobici ty 
or hydrophilici ty property oY the side chain of each amino acid residue 
of the query sequence, and 

choosing at least one \template protein among the reference 
proteins that has high similarity in three-dimensional structure 
to the protein comprising the \query sequence. 

2. The method according to claim 1, wherein the amino acid 
sequence of each of the reference proteins is divided into two or 
more segment sequences comprising two or more continuous amino acid 
residues based on the three - dimens^ional structure of the reference 
protein . 

3. The method according to cVlaim 2, wherein the amino acid 
sequence of each of the reference proteins is divided into one or 
more core segment sequences which substantially participate in the 
formation of hydrophobic core, and into one or more sub segment 
sequences which do not substantially participate in the formation 
of hydrophobic core. 

4. The method according to any uin\ ul ^claimjf 1 tsc i > , wherein 
the matching is conducted based on the Information on the degree 
of burial into the inside of the protein or the degree of exposure 
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to the protein surface of the side chain of each amino acid residue 
in the reference protein, together with the properties of 
hydrophobicity or hydrophil ici ty of each amino acid residue in the 
query sequence. 

5. The method according to -emy— €mipu of, claim 1 to -^h wherein 
the matching is perf ormedby sliding one or more core segment sequences 
of the reference protein on the query sequence without consideration 
of any gaps except those at one end or both ends of the core segment 
sequences . 

6 . The method according to claim 5 , wherein the gap is a deletion 
or addition of one or more amino acid residues . 

7. The method according to aui* uiitr Uf" claim^ 1 - to , 6 ., wherein 
the matching comprises the steps oty 

(a) the step of matching by/sliding one or more core segment 
sequences on the query sequenc^ optionally considering gaps at one 
end or both ends of the co^re segment sequences, provided when two 
or more core segment se^^uences are used, the core segment sequences 
are placed in the o^er of appearance on the amino acid sequence 
of the referenc^r protein; and 

(b) sui^equent to the above step (a) , the step of sliding one 
or more 9^ segment sequences on the query sequence, optionally 
consi/^ering one or more gaps. 

8. The method according to an^ una ofr claimi^ 1 tMO 7 * wherein 
the optimum matching is selected based on calculated scores obtained 
from the environmental information on the side chains of the amino 
acid residues of the reference protein and the hydrophobicity 
parameters of the corresponding amino acid residues on the query 
sequence . 

9. The method according to claim 8, which further comprises 
the step of normalizing the above scores by using a self matching 
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score for the reference protein. 

10. The method according to le m y-onfa. n £ claim^ 1 which 
further comprises the step of constructing the three-dimensional 
structure of the protein comprising the query sequence. 

11. A database which contains environmental information on 
the side chains of amino acid r^e^dues of one or more reference proteins 
with known or predictatoire three-dimensional structures, and which 
is used for the ro^hod according to ^ny ono o£ -claimjl 1 to 10. 
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